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Abstract 

Search in cyclic AND/OR graphs was traditionally known to be an unsolved problem. In 
the recent past several important studies have been reported in this domain. In this paper, 
we have taken a fresh look at the problem. First, a new and comprehensive theoretical 
framework for cyclic AND/OR graphs has been presented, which was found missing in the 
recent literature. Based on this framework, two best-first search algorithms, SI and S2, have 
been developed. SI does uninformed search and is a simple modification of the Bottom- 
up algorithm by Martelli and Montanari. S2 performs a heuristically guided search and 
replicates the modification in Bottom-up's successors, namely HS and AO*. Both SI and S2 
solve the problem of searching AND/OR graphs in presence of cycles. We then present a 
detailed analysis for the correctness and complexity results of SI and S2, using the proposed 
^) . framework. We have observed through experiments that SI and S2 output correct results in 

I/"") ' all cases. 
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AND/OR graphs [Chang and Slagle 1971; Martelli and Montanari 1973, 1978; Levi and Sirovich 
1976; Nilsson 1980; Bagchi and Mahanti 1983; Pearl 1984; Mahanti and Bagchi 1985; Kumar 1991] 
are generalizations of directed graphs used in the problem-decomposition approach in artificial 
intelligence. In an AND/OR graph, a node represents a problem to be solved which can be 
decomposed into several smaller subproblems, which, in turn, may be broken down into even 
smaller subproblems and so on. The basic objective in searching an AND/OR graph is to find a 
solution graph of least cost following a cost criterion denned suitably. 

Till date, the most famous among AND/OR graph algorithms is AO* [Nilsson 1980], which 
follows the principle of best-first search and uses an admissible or lower bound heuristic. There 
has also been work on other issues, such as searching in the presence of inadmissible heuristics 
[Mahanti and Bagchi 1985; Chakrabarti, Ghosh and DeSarkar 1988] and searching in limited 
memory [Chakrabarti, Ghosh, Acharya and DeSarkar 1990]. However, a critical assumption in all 
of these work is that, the underlying AND /OR graph must be acyclic. Without this assumption - 
i.e. in the presence of cycles - the AND/OR graph search becomes much more complicated. There 
is an initial difficulty in defining a solution graph in a cyclic AND/OR graph. There are other 
problems in using an algorithm like AO* which is based on the technique of arc-marking, such as 
it may end up marking a cycle. These difficulties had led to the assumption of acyclicity, which 
permeated the whole of AND/OR graph literature and was re-emphasized in [Nilsson 1980]. 
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1 Introduction 



It can be easily observed that the assumption of acyclicity is not always acceptable in real 
life problems. For instance, in logic programming every logical equivalence represents a cycle in 
the corresponding AND/OR graph. There are several other applications where cyclic AND/OR 
graph formulations are useful, such as in assembly/disassembly sequences [DeMello and Sanderson 
1991; Jimenez and Torras 2000] and robotic task plans [Cao and Sanderson 1998]. This problem 
of searching AND/OR graphs in the presence of cycles has attracted the attention of researchers 
for a long time, and in recent times several algorithms have been reported [Chakrabarti 1994; 
Hvalica 1996; Jimenez and Torras 2000] that solve AND/OR graphs containing cycles. However 
a common issue with all of these studies is that, they do not provide a theoretical framework for 
cyclic AND/OR graphs. They are based mostly on the existing framework for acyclic AND/OR 
graphs, which makes it difficult to actually prove any of the properties of the algorithms. 

The major contributions of this paper are as follows: 1 

1. We provide a new theoretical framework for cyclic AND/OR graphs, which subsumes the 
existing framework for acyclic graphs. This framework redefines all the basic concepts such 
as solution graph, cost of solution and admissible heuristics taking into consideration the 
presence of cycles. 

2. We present two algorithms, SI and S2, for searching AND/OR graphs with cycles. Algorithm 
SI searches the implicit graph without constructing an explicit graph, while S2 is an AO*- 
style algorithm that, at each stage, calls an explicit-graph-growing outer loop and a cost- 
computing inner loop. S2 uses heuristic estimates of nodes to guide the search intelligently, 
which SI does not. SI and S2 are easy to understand and are designed using the well-known 
principle of best-first search. 

3. We discuss in detail the theoretical properties of SI and S2. These theoretical properties are 
proved using the proposed AND/OR graph framework mentioned above. 

4. Finally, the experimental results presented in this paper shed some light on the performance 
of the various algorithms both in acyclic and cyclic cases, and provide further insights into 
the development of algorithms for AND / OR graphs. 

2 Definitions and Previous Work 

In this section, we make a brief survey of the existing literature on AND/OR graphs. 2 
2.1 Definitions 

An AND/OR graph G is a directed graph where a node represents a problem to be solved, and 
its immediate successor nodes represent the subproblems into which the parent problem can be 
transformed or decomposed. G contains a special node, the start node, that represents the initial 
(root) problem to be solved. G also contains a set of leaf nodes, which are of two types: terminal 
and nonterminal. While the terminal leaf nodes represent subproblems with known solutions, the 
nonterminal leaf nodes represent subproblems which are not solvable. Each node has finitely many 
children. Any node can be either an AND node or an OR node. An OR node can be solved by 
solving any of its children, while an AND node can be solved by solving all of its children. Without 
any loss of generality we assume that all leaf nodes are OR nodes. (It is important to note that 
these definitions of AND and OR nodes are in line with [Pearl 1984] and [Martelli and Montanari 
1973, 1978], but not same as in [Nilsson 1980], which uses the concept of k- connectors.) 

1 Our study involves the search for solution graphs that do not contain cycles. This is the conventional direction 
of AND/OR graph work, as orthogonal to a recent study [Hansen and Zilberstein 1998] in which the authors work 
on a variation of the conventional model that allows cycles in the solution graphs. 

2 Any term used here but not explicitly defined will follow the meaning as given in [Nilsson 1980]. 



We represent the start node by s, the terminal leaves by t, ti, £2, ... and all other nodes by m, 
n, p, q, r,.... For the sake of completeness, we allow the start node s to be either a terminal leaf, 
or a nonterminal leaf, or any internal node of G. The set of terminal leaves is denoted by T, and 
the set of nonterminal leaves by NT. Thus T (J NT is the set of all leaf nodes in G. 

Each arc in G represents the application of a production rule. Generally a cost is associated 
per rule application - thus each arc (m,n) of G has a cost c(m,n) > 5 > associated with it, 
where 5 is a small positive number. 

Let G be an acyclic AND/OR graph and m be a node in G. A solution graph D{m) rooted 
at or below m is a finite subgraph of G that represents a complete solution to m. It is defined as 
follows: 

1. m G D{m); 

2. if n is an OR node in G and n is in Dim), then exactly one of its immediate successors in 
G is in D(m); 

3. if n is an AND node in G and n is in D(m), then all of its immediate successors in G are in 
D(m); 

4. every maximal directed path in D{m) ends in a terminal leaf node. 

Note that there may be multiple solution graphs rooted at a node, and each solution graph 
satisfies the definition of an AND/OR graph. A function h(n, D(m)) assigns a cost value to each 
of the nodes n in D(m) as follows: 

1. h(n, D{m)) = if n is a terminal leaf node; 

2. h(n, D{m)) = c(n, n') + h(n', D{m)) if n is an OR node and n' is its immediate successor in 
D(m); 

3. h(n,D(m)) = X]f =1 [c(n, rii) + h(n i: D(m))\ if n is an AND node with immediate successors 
rii, .., nfc in D{m). 

Thus, /i(m, D(m)) is the cost of a solution graph D{m) below m. If m has one or more solution 
graphs below it, then the cost of a minimal-cost solution graph is denoted by h*(m). If m has 
no solution graph below it, h*(m) is taken to be oo. Thus h*(s) is the cost of a minimal cost 
solution graph below s. A search algorithm is required to output a solution graph below s with 
minimal cost. The definition of cost used here represents the sum- cost criterion; there is another 
alternative, called the max-cost criterion which differs in the way that the cost of an AND node 
n is evaluated as the maximum of c(n, rij) + h(ni, D(m)), evaluated over all its children n,. 

As described above, an AND/OR graph is implicitly defined by the root node s, set of produc- 
tion rules and their costs, a heuristic function, and a set of terminal and nonterminal leaf nodes. 
This is called the implicit graph, G, and a search algorithm usually works by constructing a sub- 
graph of the implicit graph, called the explicit graph G' . Initially only the root node s belongs to 
the explicit graph G' . Once s is expanded, its children and all their connecting arcs are added to 
G' . G' grows as more and more nodes are expanded and new nodes and arcs are added to it. At 
any instant the nodes of G' which have no children are called tip nodes. 

In the study of acyclic AND/OR graphs, the notion of a potential solution graph (psg) of an 
explicit graph is very similar to the notion of a solution graph of an implicit AND/OR graph. A 
psg D'{m) below a node m in G' is a finite subgraph of G' with the following properties: 

1. me D'(m); 

2. For every node n E D'(m) which is not a tip node of G'\ 



(a) exactly one of the immediate successors of n in G' is in D'{m) when n is an OR node; 

(b) all of its immediate successors in G' are in D'(m) when n is an AND node; 

3. every maximal directed path in D'{m) ends in a tip node of G' . 

Occasionally a non-negative heuristic function h(n) is defined for each node n in G. The value 
of this heuristic function is an estimate of the cost of solving that node, and is used in search 
algorithms to guide the search process towards least-costly solution graphs. A heuristic function 
is called admissible if h(n) < h*(n)\/n e G. The heuristic estimate is for a terminal leaf node 
and oo for a nonterminal leaf node. 

The cost of a node n in a psg D'(m), denoted as h(n, D'(m)), is defined in an identical 
manner as the cost of n in a solution graph D(m), with the first condition being replaced by 
h{n, D'(m)) = h(n) if n is a tip node in G '. The cost of a minimal-cost psg below node n in G' is 
denoted as h'(n). 

2.2 Algorithms for Acyclic AND/OR Graphs 

One of the early algorithms on AND/OR graphs was the Bottom-up algorithm [Martelli and 
Montanari 1973]. This algorithm, which is an extension of the shortest-path algorithm [Dijkstra 
1959], operates on the (entire) implicit AND/OR graph and evaluates the nodes according to a 
dynamic schedule determined by the cost-dominance of nodes. It was followed by an improved 
algorithm, HS [Martelli and Montanari 1978], that takes heuristic information into account and 
works by creating an explicit graph. HS is a top-down iterative method that first constructs an 
explicit graph G' with only node s. In each iteration, HS chooses a tip node of G' (initially the 
start node s) for expansion, and adds its children with their connecting arcs to G' . Then a bottom- 
up cost revision process is performed, whereby at each OR node one of its least-costly children is 
chosen and the corresponding arc is marked; at each AND node all the children are chosen and 
their arcs are marked. At the end of this process, below every node n in G', a complete psg D'{n) 
is marked. This marked psg D'{n) is also a least costly psg below n. In this manner the iterations 
continue, until it is found that the marked psg below s is a solution graph which is then outputted 
by HS, or the marked psg contains a nonterminal leaf node when failure termination is reported. 
Algorithm HS was modified by introducing the concept of /c-connectors and renamed as algorithm 
AO* [Nilsson 1980]. AO* has been traditionally used to find minimal-cost solutions to AND/OR 
graphs. 

It was proved by [Martelli and Montanari 1978] that HS outputs minimal-cost solutions if the 
heuristic function satisfies the monotone restriction. [Bagchi and Mahanti 1983] generalized this 
result and showed that HS output minimal-cost solutions even if the heuristic function merely 
satisfies the weaker condition of admissibility. This generalization easily carries over to AO* also. 

However, as has been widely documented, AO* can fail if the AND/OR graph contains cycles, 
which we illustrate in Figures 1(a) and 1(b). In both figures, t is a terminal leaf node and marked 
arcs are crossed by a line. 

In the explicit graph shown in Figure 1(a), AO* expands nodes s, p and q in the first three 
iterations and then expands r. Now during the bottom-up computation, it gets stuck with nodes 
p and q. The bottom-up computation uses a list Z [Nilsson 1980, pp 104, Step 10] to ensure 
that the nodes being evaluated are selected in a topological order. However in this example, after 
expansion of r both p and q will be present in Z and none can be selected for evaluation as it is 
the predecessor of the other. Hence AO* gets stuck, even though there is a solution graph of cost 
14 below s. 

In the explicit graph shown in Figure 1(b), on the other hand, AO* first expands the root node 
s and marks the arc (s,p) since p is the least-costly child below s. Then, after expanding the node 
p, it marks the arc (p,p). Now, during the bottom- up cost revision process it tries to go upward 



Figure 1: Problems with AO* in case of cycles 
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following all marked arcs above p, thereby entering into an infinite self-loop. However, there exists 
a solution graph from s to the terminal leaf t with a cost of 3. 

These examples clearly show that in the presence of cycles, AO* may never even terminate, 
let alone find a minimal-cost solution. 

2.3 Algorithms for Cyclic AND/OR Graphs 

The first attempt in searching cyclic AND/OR graphs was made in 1994 when two algorithms, 
Iterative_revise and REV* were presented [Chakrabarti 1994]. While Iterative_revise is a top-down 
recursive algorithm for searching AND/OR graphs, REV* is a strictly bottom- up algorithm that 
uses parent-pointers and has a better performance than Iterative-Revise. Its operation is briefly 
described below. 

REV* starts searching the AND/OR graph by putting all its leaf nodes in a list called OPEN 
and assigning heuristic values to them. It then does the following work iteratively: takes out a 
least-costly node from OPEN, assigns cost values to its parent nodes by using parent-pointers and 
goes up the graph as long as the siblings of the current level node have all been evaluated. When 
it gets stuck in this upward phase, it again selects a node from OPEN and starts another upward 
phase to evaluate nodes. In this way REV* continues until the start node has been evaluated. 
Cycles get eliminated due to the cost dominance rule, and REV* outputs a correct solution cost 
whenever there is a solution graph. 

REV* is a simple algorithm that solves the long-standing problem of searching explicit AND/OR 
graphs in the presence of cycles. Recently, it has been shown [Jimenez and Torras 2000] that it 
is possible to improve the efficiency of REV* significantly by making some modifications. These 
modifications derive partly from an earlier work in the acyclic domain to improve the efficiency of 
AO* namely, algorithm CF [Mahanti and Bagchi 1985]. CF's control structure for node expansion 
has been utilized in [Jimenez and Torras 2000] in their algorithm INT. As the authors observe, 
INT's "top-down search strategy is based on Mahanti and Bagchi's CF, whereas its bottom-up cost 
revision process is inspired in Chakrabarti's REV*." The bottom-up cost-revision process of INT 
is primarily based on REV*; but in the process, it also employs the superior cost-updation strategy 
of CF. This makes INT a feasible alternative for searching cyclic AND/OR graphs. However, it 
still has the inefficiency that nodes are considered for cost-revision even when they are not likely 
to be affected as a result of the current node-expansion. As it is well-known in AO* and CF-like 
algorithms, it is sufficient and economical to visit only nodes whose costs, arc markings or solved 
status change as a result of the expansion of a new node. This observation has been implemented 
in their next algorithm CFCrev* by creating the OPEN list with only a subset of the leaf nodes. 
As the OPEN list ultimately decides which nodes are going to be visited in the cost-revision phase 
of the algorithm, this strategy significantly cuts down the number of nodes evaluated, particularly 
when the percentage of AND nodes is high. CFCrev* has been implemented and found to be 
very efficient compared to its predecessor REV*. However, the CFCrev* algorithm is unwieldy 
and non- intuitive. 
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Figure 3: Combining solution graphs may lead to cycles 
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Along with the work on REV* and CFCrev*, one other attempt [Hvalica 1996] has been 
made to solve cyclic AND/OR graphs. In Hvalica's algorithm, a special technique is used for loop 
avoidance. When a node is expanded a dummy node Xf is attached to it with a high heuristic value. 
If expansion of the current node creates a cycle, the dummy node Xf offers an alternative route 
to come out of it. The exit from the cycle occurs when the cost of the expanded node, computed 
through its children, exceeds the high cost of the dummy child Xf. This method, although quite 
interesting, may become inefficient in practice. 

2.4 Motivation for current work 

Although the recent algorithms on cyclic AND/OR graphs as discussed in the preceding subsection 
are claimed to be correct, their correctness proofs stand on a weak theoretical base. This is due to 
the reason that the theoretical constructs such as solution graph, potential solution graph, cost of 
solution etc. which are fundamental to the AND/OR graph algorithms and their properties, have 
never been properly re-established in the context of cycles. For instance, if we look at Figure 2, 
we find this interesting question: does node p have a solution graph below it? Looking from the 
top (i.e. from start node s) it would appear not (as that would create a cycle with predecessor 
s) ; however looking from the node p itself there is a solution graph below it passing through node 
s. So how do we construct a solution graph in the presence of cycles? How do we define the cost 
function appropriately? These questions have never been adequately addressed in the literature. 
The recent papers on cyclic AND/OR graphs [Chakrabarti 1994; Hvalica 1996; Jimenez and Torras 
2000] seem to have completely ignored this issue and worked with cyclical constructs. 3 

The graphs in Figure 3 give another example of the necessity of theoretical support for cyclic 
AND/OR graph algorithms. The graphs in Figure 3(a) and 3(b) are solution graphs below p and 
q respectively, in the implicit graph of Figure 1(a); however if they are combined below s as in 
Figure 3(c), they clearly do not form a solution graph below s. What is more fatal is that they 
create a cycle. The question then arises: can such solution graphs be at all combined (in an 
appropriate way) to form a solution graph below the parent? We delve deep into these issues in 
this paper. 

3 Examples of such cyclical definitions are presented in the Appendix for the purpose of review. 



We first provide a theoretical framework for cyclic AND / OR graphs, which forms the concep- 
tual basis for our algorithms. We then present two algorithms, SI and S2, which are based on the 
time-tested principle of best-first search (the previous algorithms do not seem to have adhered to 
this principle). The analysis of SI and S2 have been done in detail using our theoretical framework. 
We conclude the paper with some comments on the results of our detailed experiments. 

3 Framework for Cyclic AND/OR Graphs 

In this section we generalize the existing AND/OR graph search framework for graphs containing 
cycles. In the proposed new framework, the concept of Maximal Extendable Subgraph (MES) 
plays a pivotal role. 

An MES below a node is constructed in a top-down manner as explained below. During the 
construction, at an OR node x we take one child and include it if it does not form a cycle with the 
part of the MES constructed so far. If x has no child, or if the selected child forms a cycle, the 
construction ceases to proceed below x. When x is an AND node, construction continues below x 
only if no child of x forms a cycle with the part of the MES constructed so far. 

3.1 MESs in an Implicit AND/OR Graph 

Definition 3.1 For any AND/OR graph Q (implicit, explicit or other) we define the following: 

i) For any node p in Q, the childset is T(p, Q) = {q \ q is a child of p in Q}. 

ii) The set of all nodes of Q which have no children is Zq = {x \ x G Q /\T(x, Q) = <p}. 

For example, a leaf node, say x, of an implicit graph G is one for which T(x, G) = <p. Again, 
in G, Zq = T U NT = set of all terminal and nonterminal leaf nodes. 

Definition 3.2 Let G be an implicit AND/OR graph andn be a node in G. A Maximal Extendable 
Subgraph (MES) M(n,G), rooted at or below n, is defined as follows: 

i) n G M(n,G) 

ii) For every node x G M(n, G) which is an internal node in G: 
Let yi, y2,---,Vk be the children of x in G. Now, 

a) If x is an OR node, then select any one child yi, 1 < i < k. Include yi in M{n,G) 
if it is not same as x or any predecessor of x in M{n,G). Otherwise the construction 
ceases to proceed below x. 

b) If x is an AND node, then include all y i} 1 < i < k, if none of them is same as x or 
any predecessor of x in M(n, G). Otherwise the construction ceases to proceed below x. 

For any node n G G, the MESs below it are enumerated as iWi(n, G), M 2 (n, G), . . .. On the 
other hand, occasionally we may write M(n, G) as M{n) or just as M, when the parameters are 
clear from context. 

Remark 3.1 

(i) If G contains paths of infinite length, then it is clear from the construction that there may be 

MESs with infinitely many nodes and arcs. 

(ii) From the definition, it is clear that MESs cannot contain cycles. 
(Hi) There must exist at least one MES below every node in G. 



Figure 4: An Implicit Graph and its MESs 
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Figure 5: Duplicate MESs below a node 




An AND/OR graph G and all of its MESs are shown in Figure 4. It is interesting to note 
that some MESs may appear to be duplicate - for instance, the MESs M 3 (s, G) and M 5 (s, G) may 
appear to be the same even though they are actually different. This happens due to the presence 
of cycles, as is illustrated in Figure 5. In this figure, the dotted arrows represent the different 
children that these MESs selected at node n. But in either case, the selected child created a cycle 
and the MES was terminated at node n. Thus even though these two MESs appear to be same, 
they attempted to include different children below a node and hence are distinct. It may also be 
noted that an MES may appear to be a subgraph of another MES. Thus, M 3 (s, G) appears to be 
a subgraph of M 4 (s,G) in Figure 4. This is also explained by the presence of cycles. M 3 (s,G) 
and M 4 (s,G) have chosen different children, namely n and r, below node n - the former leading 
to a self-loop (and hence terminating the MES) and the latter continuing to node r. Thus the two 
MESs are distinct and none is a subgraph of the other. 

The question naturally arises: what can be the upper bound of MESs in a graph containing 
N nodes? It may be easily verified that the number of MESs will be maximum if (a) the graph 
contains only OR nodes, thereby providing more choices at every node, and (b) the graph is 
structured as in Figure 6 (this graph is the particular instance for N = 3). Here, the number of 
MESs with k nodes, 1 < k < N, is k x N P k , and the total number of MESs is £f=i k x N P k . 

Definition 3.3 Let M be an MES below a node n in G. Then below every node x G M we define 
a sub-MES £(x, M) of M identically as M is defined in G, by replacing M with £ and G with M 
everywhere. 

An AND/OR graph G and some of its MESs and their sub-MESs are shown in Figure 7. 
Remark 3.2 

(i) A sub-MES may or may not be an MES itself. This is shown in Figure 7. The sub-MES 
Mi(s, G)), coming from Mi(s,G), is not an MES in G (it would have been an MES if 
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the arc (x,n) was present, as in M\{p,G)). However, the sub-MES M\{n, G)), coming 
from Mi(n, G) is an MES in G. In general, if a sub-MES is rooted at the same node where 
the MES is rooted, the sub-MES will be an MES in G. Sub-MESs rooted at other nodes in 
the MES may or may not be MESs themselves in G. 

(ii) There is exactly one sub-MES below every node in an MES. 

3.1.1 Classification of MESs in Implicit Graphs 

Depending upon the types of node (i.e. terminal leaf, nonterminal leaf, or other) that the maximal 
paths of an MES terminate in, it is classified into different types as follows. Note that, for any 
MES M, Z M represents the set of all nodes which have no children in M. 

Definition 3.4 An MES M(n, G) below a node n e G is said to be a 

i) type-I MES (or, a solution graph,), if Z M { n ,G) Q T; 

ii) type-II MES, if 

a) Z M{n)G) C Z G , and 

b) Z M{n>G) r\NT^<f>; 

Hi) type-Ill MES, if Z M{n , G) £ Z G . 

The different types of MESs capture the information whether all the paths end in terminal leaf 
nodes, or some end in nonterminal leaf nodes, or some paths even get stuck in cycles on the way. 
The type numbers I, II and III are a shorthand way of depicting the nature of an MES. 

For example, for the implicit graph G given in Figure 8(a), four MESs below n are shown in 
Figure 8(b)-8(e). These MESs are of types I, II, III and III respectively. In Figure 8(f), Mi(p, G) 
is a type-I MES below p that passes through n. It is interesting to note that n is a predecessor of 
p in G. In Figure 8(g), Mi(q,G) is a type-Ill MES due to the self-loop at q. In Figure 8(h), the 
subgraph is not an MES, nor is it a sub-MES of M 3 (n, G). 

3.1.2 Classification of Nodes in Implicit Graphs 

A node in an implicit graph is classified into different types, depending on the type of MESs below 
it. 

Definition 3.5 A node n in G is said to be of 

i) type-I, if there is a type-I MES below n in G. 

ii) type-II, if there is no MES of type-I, but at least one MES of type-II below n in G. 
Hi) type-Ill, if there is no MES of type-I or type-II below n in G. 

Remark 3.3 

(i) We classify the nodes as above to highlight the information contained in them about their 
solvability, or the reasons for not being solvable. A type-I node is a solvable one (i.e. it 
contains a solution graph below it), a type-II node is not solvable as each of its MESs 
contains one or more nonterminal leaves which are known to be unsolvable, and a type-Ill 
node is not solvable as each of its MESs gets stuck at some internal node(s) of G due to 
cycles. Thus there is a distinct difference between a type-II node and a type-Ill node in G. 



igure 8: An implicit graph with some MESs 
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Figure 9: Three implicit graphs and their node types 
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(a) Implicit Graph G\ 




(b) Implicit Graph G2 
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[c) Implicit Graph G3 



(ii) If the graph G is acyclic, there cannot be any type-Ill nodes in G. However, even if the graph 
is cyclic, there may not be any type-Ill nodes in it, as is evident from Figures 9(a) and 9(b). 

For example, in Figures 9(a) to 9(c) we show three implicit graphs G\, G2 and G3 and label 
each node with its type (I, II or III). Observe that, changes from G\ to G 2 happen because x has 
been made an AND node, and changes from G 2 to G3 happen because p has also been made an 
AND node. 



3.1.3 Properties of MESs 

Now we discuss the inter-relationships between the different types of nodes and different types of 
MESs. These results are the important building blocks in this proposed new framework for cyclic 
AND/OR graphs. 



Lemma 3.1 For any MES M(n, G): 



Figure 10: Illustration of Lemma 3.2(iii] 
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IfM(n,G) is of type- 1, then for every p G M(n,G) the sub-MES M(n,G)) is also an 
MES, and it is of type-I. 

ii) If M(n,G) is of type- II, then for every p G M(n,G) the sub-MES £(p, M(n,G)) is also 
an MES, and it is of type-I or type-II. 

Proof. Clear. □ 

Lemma 3.2 Let M be an MES below a node n G G and p be any node in M. Now, 

i) If M is of type-I then p must be of type-I. 

ii) If M is of type-II then p must be of either type-I or type-II, and at least one such p is of 
type-II. 

Hi) If M is of type-Ill then p may be of type-I or type-II or type-Ill. 

Proof. Clear. □ 
As an example of Lemma 3.2(m), we refer to Figure 10. Even though the MES Mi(s,Gi) in 
Figure 10(b) is of type-Ill, all of its nodes s and n are of type-I in G\ in Figure 10(a). Again, 
MES Mi(s, G2) in Figure 10(d) is of type-Ill, but all of its nodes s and n are of type-II in G2 in 
Figure 10(c). It should be noted that the type of a node is defined globally based on all the MESs 
below it, but the type of an MES M is based on the nodes of Z M - whether they are terminal, 
nonterminal of internal nodes of G. 

Theorem 3.1 Sub-problem Composition Theorem for Implicit Graphs. Let G be an im- 
plicit AND/ OR graph and n be any internal node in G. Now, 

i) If n is an OR node, then: 

(a) n is of type-I iff at least one child of n is of type-I; 

(b) n is of type-II iff no child of n is of type-I but at least one child is of type-II 

(c) n is of type-Ill iff all children of n are of type-Ill. 

ii) If n is an AND node, then: 

(a) n is of type-I iff every child of n is of type-I; 

(b) n is of type-II iff at least one child of n is of type-II and no child is of type-Ill; 

(c) n is of type-Ill iff at least one child of n is of type-Ill. 

Proof. Similar to the proof of Theorem 3.2 on Explicit Graphs, presented later. □ 



Figure 11: Two implicit graphs 



(a) Implicit Graph d 




(b) Implicit Graph G 2 



Figure 12: MESs from the implicit graphs of Figure 11 




(a) M^d) 
(type-I) 



(b) M 2 (q,G 2 ) 
(type-II) 




(c) M 1 (s,G 2 ) 
(type-Ill) 



3.1.4 Costs of MESs in Implicit Graphs 

Finally we come to the notion of costs. For AND/OR graphs with cycles, we define the cost with 
respect to an MES. 

The definitions in this section will be illustrated using the graphs shown in Figures 11 and 
12. These graphs show the arc-costs beside each arc and the heuristic values in parenthesis beside 
each node. The heuristic values of terminal leaf nodes are assumed to be zero. These heuristic 
values will be useful in later discussions on explicit graphs, where the same figures will be referred. 

Definition 3.6 Given any implicit graph G, for any MES M(n, G), we denote the cost of a node 
u in M(n,G) by f3(u,M(n,G)). We define this cost function (3(u, M(n,G)) as follows: 

(3(u,M(n,G)) 
= if u G T; 
= oo if u E NT; 

= c{u,w) + P(w,M(n,G)), if u is an OR node and w is the child of u in M{n,G) 
= Y.weT{u,M{n,G)){c{u,w) + P(w, M(n,G))} if u is an AND node. 

P(u,M(n,G)) is undefined if u e Z M \ {T\JNT}. 

In Figure 12(a), (3(h, M^s, d)) = 0, (3(p, M^s, d)) = 5, (3(r, M^s, d)) = 6, (3(q, M^s, d)) = 
7 and 0(s, M^s, d)) = 14. Similarly 0(t 2 , M 2 (q, G 2 )) = 0, 0(y, M 2 (q,G 2 )) = oo (since y G NT), 
(3(x,M 2 (q,G 2 )) = oo and (3(q, M 2 (q,G 2 )) = oo. In Mi(s,d), Mi(s, G 2 )) = 0. But since 
/3(r, Mi(s, d)) is undefined, the /5-values of p, q and s are also undefined. 

Definition 3.7 Given any implicit graph G, let Mi(n, G), M 2 (n, G), . . ., be the all possible type-I 

or type-II MESs below n. Thenh*{n), the cost of a minimal- cost MES below n = glbi>i{j3(n, Mj(n,G))}; 

if no type-I or type-II MES exists below n, h*(n) is undefined. 



It may be easily verified that in Figure 11(a), h*(s) = 14, and the only minimal-cost MES is 
Mi(s, G\). In Figure 11(b) h*(s) is undefined, while h*(q) = oo. 

Remark 3.4 For any node n in an implicit graph G, 

(i) if n is of type-I, then h*(n) < oo; 

(ii) if n is of type-II, then h*(n) = oo; 

(in) if n is of type-Ill, then h*(n) is undefined. 

Lemma 3.3 Let M(n) be a minimal-cost MES below a type-I or type-II node n in G. Then for 
every node x E M{n), the sub-MES £(x, M(n)) is also a minimal-cost MES below x. 

Proof. Clear. □ . 

3.2 MESs in an Explicit AND/OR Graph 

The concepts of MES and sub-MES, and the costs and properties of an MES, have been discussed 
in detail for an implicit AND/OR graph. Now in the context of an explicit AND/OR graph, we 
present similar concepts. 

In an explicit graph G', the child_set is defined identically as in Definition 3.1. Zq> represents 
the set of all tip nodes in G", i.e. Z G < = {x | x E G' f\(T(x, G') = 0)}. 

Definition 3.8 An MES M(n, G') in an explicit graph G' is defined similarly as in the implicit 
graph G, with G replaced by G' throughout. 

As in the case for implicit graph, multiple MESs below a node n E G' are named as Mi(n, G"), 
M 2 (n,G') .... 

Definition 3.9 A sub-MES ^(p, M(n, G')) below a nodep in an MES M(n, G') is defined similarly 
as a sub-MES below a node p in an MES M(n, G) . 

Remark 3.5 

As explained in Remark 3.2 using Figure 7 for implicit graphs, in case of explicit graphs as well 
a sub-MES may not itself be an MES. In this context, we may consider the entire implicit graph 
in Figure 7(a) as an explicit graph. Then the sub-MES Mi(s, G')), obtained from M\(s,G') 
will not be an MES in G' . 

3.2.1 Classification of MESs in Explicit Graphs 

In case of an implicit graph, the MESs could be distinguished into type-I, type-II and type-Ill, 
as they are fully extended up to the leaf nodes of G, or immediately prior to the formation of a 
cycle. In an explicit graph, those MESs which have encountered cycles are labelled as type-Ill. 
On the other hand, those MESs which are yet to encounter cycles cannot be labelled as type-I or 
type-II. This is because, such MESs may not be fully extended up to the leaf nodes of G yet. All 
such MESs are collectively labelled as non-type-III. A non-type-III MES is identical to a potential 
solution graph (psg) defined in the context of an acyclic AND/OR graph. In this paper, we shall 
use these two terms interchangeably. 

Definition 3.10 In an explicit graph G' , an MES M(n,G') is said to be a: 

i) non-type-III MES (or a potential solution graph or psg), if Z M ( n ,G') Q Z G r } 



Figure 13: Two explicit graphs and their MESs 
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it) type-III MES, if Z M{ n,G>) £ Z G ,. 

For example, consider the explicit graph G[ in Figure 13(a), obtained by expanding nodes s, 
n and p from G in Figure 8(a). We show a psg, Mi(n, G[) below n in Figure 13(b). When the 
explicit graph is augmented by expanding node x to form G' 2 , the corresponding MES Mi(n, G' 2 ), 
however, becomes type-III (shown in Figure 13(d)). Note that Mi(n, G[) and Mi(n, G 2 ) look quite 
similar although they are actually different. In ilfi(n, G[) a; is a tip node, while in Mi(n, G' 2 ) x is 
an expanded node. 

3.2.2 Classification of Nodes in Explicit Graphs 

The nodes in an explicit graph are classified into two types depending on the type of MESs below 
them. 

Definition 3.11 In G' , a node is called non-type-III if it has a non-type-III MES (or psg) below 
it, otherwise it is called type-III. 

For example, in Figures 13(a) and 13(c), both p and n remain non-type-III before and after 
expansion of x. In both G[ and G' 2) n has a non-type-III MES through q, while p has a non-type-III 
MES through {x,t} in G[ and through {x,t,n,p} in G' 2 . 

3.2.3 Properties of MESs in Explicit Graphs 

The MESs in an explicit graph follow the similar properties as do MESs in an implicit graph. 

Lemma 3.4 IfM(n, G') is of non-type-III, then for every p G M(n, G'), the sub-MES M(n, G')) 
is also an MES and it is of non-type-III. 

Proof. Clear. □ 



Lemma 3.5 Let M be any MES below a node n G G' . Now if M is of non-type- III, then every 
p G M is of non-type-III. 

Proof. Clear from Lemma 3.4. □ 

Lemma 3.6 Let xi and X2 be two acyclic AND/OR graphs with node and edge sets V(xi), E(xi) 
and V{x2), E(x2) respectively. Let V(xi) H V{x2) = (ft- Now, if edges {e\, e<i, e n } are 
added from nodes in V(xi) to nodes in V{x2), then the resulting graph with V = V(xi) U V{x2), 
E = E(xi) U E(x2) U {ei, e 2 , . . . , e n } is acyclic. 



Proof. Clear. □ 

Theorem 3.2 Sub-problem Composition Theorem for Explicit Graphs. Let G' be an 

explicit AND/OR graph and n be any internal node in G' . Now, 

i) If n is an OR node, then: 

(a) n is of non-type-III iff at least one child of n is of non-type-III; 

(b) n is of type-Ill iff no child of n is of non-type-III. 

ii) If n is an AND node, then: 

(a) n is of non-type-III iff every child of n is of non-type-III 

(b) n is of type-Ill iff at least one child of n is of type-Ill. 

Proof. 

i) (a) 

Let p be a child of n in G' . We assume p is of non-type-III, and M(p) is a non-type-III MES 
below p. We will show that n is of non-type-III. 

Case I: n ^ M(p): Let M{n) be the MES, created by joining the arc (n,p) to M(p). Then 
Zm(u) — Zm( p ) Q Zg>- Then n is also of non-type-III. 

Case II: n G M(p): Since M(p) is of non-type-III, by Lemma 3.5 n must be of non-type-III. 

Since it is given that n is of non-type-III, there must be a non-type-III MES M(n) below n. 
Let p be the child of n which belongs to M(n). Clearly p is of non-type-III (by Lemma 3.5). 

(b) Clear from (a). 

ii) (a) 



Every child of n is of non-type-III. We have to show that n is of non-type-III. 

Let pi,p 2 , ...,Pk, k > 2, be the children of n. Let M{j>j) be a non-type-III MES below pj. 
Clearly, n ^ M{pj)\/j ) otherwise M(pj) would contain a cycle (since n is an AND node and 
must include its child pj in any MES), thereby violating the acyclicity property of an MES. 

Now, for proving that n is of non-type-III, it is sufficient to construct a non-type-III MES 
M(n) from the given non-type-III MESs M(j>j) and the arcs (n,pj), 1 < j < k. 



We construct M(n) by first taking the nodes and arcs of M(pj)s as follows. First, the MES 
M(pi) is taken in its entirety. We call this Q\. Next, we augment Q\ with nodes and arcs 
from M(p 2 ). The selection of nodes and arcs from M(p 2 ) is a recursive process that starts 
at p 2 . If P2 £ Qii we stop there and there is nothing to select from M(p 2 ). Otherwise, 
we traverse the MES M(p 2 ) in a depth-first manner, selecting nodes and arcs on the way, 
until we arrive either at a tip node of M(p 2 ) or at a node q in Qi. In the former case, the 
selection process stops by selecting the tip node of M(p 2 ); in the latter case, it stops at 
the node q in Q 1 . The nodes and arcs that are selected from M(p 2 ) are added with Qi, to 
form an augmented graph Q 2 . Next, we take Q 2 and add nodes and arcs from M(p 3 ) in a 
similar manner, to form Q3. The process goes on like this, until we have considered each 
of the M(pj)s and come up with a graph Qk- We now present the procedure for the said 
construction of Qk- In this construction, V denotes the set of all nodes, and E the set of all 
directed arcs of an AND/OR graph. 

Procedure Construct_Qk 

1. Set Q 1 = M( Pl ) (i.e. V{Q{) = V(M( Pl )) and E{Q X ) = E(M(p 1 ))); 

2. For j — 2 to k, construct Qj as follows: 
Initialize: V(Qj) = V{Q^) and E(Qj) = E(Q j ^ 1 ); 
CallAdd^); ' 

Procedure Add(x) 

If x i V(Qj) 

V(Qj) = V(Qj) U {x}; 
If x has no children in M(pj) 
Return; 

Else 

For every child y of x in M(pj) 
Begin 

E(Q j ) = E(Q j )U(x,y); 
Call Add(y); 

End 
Return; 



Finally, we add the node n, and each of the arcs (n,pj), 1 < j < k, to Qk, which gives us a 
structure called M(n). We claim that M(n) is a non- type-Ill MES below n. This is proved 
through the following three propositions. 

Proposition 1. Qj is acyclic, 1 < j < k. 

The proof is by induction on j. For j = 1, we have Q\ = M{pi) which is an MES and must 
be acyclic. 

Let us assume that all the graphs up to Qj~i are acyclic. Now Qj is constructed from 
Qj-i and M(j)j) which are both acyclic; moreover we do not include any arc from a node 
in V(Qj-\) to a node in V(M(pj)) \ V(Qj-i). Thus it follows from Lemma 3.6 that Qj is 
acyclic. 

Hence Q k is acyclic. 

Proposition 2. For every node m in Q k , Qk contains a non-type-III MES below m. 
The proof is by double induction, first on j and then on the nodes of Qj. 
For j = 1, the proposition holds for all nodes of Q±, by Lemma 3.4. 



Next we assume that the proposition holds up to Qj-i- We show that it also holds for Qj. 

Let V = V(Qj) \ V(Qj-i). Let m 1 , m 2 , . . . , mi — pj be the nodes of V, sorted topologically. 

mi is a tip node: Then the MES below m\ is m\ itself and is clearly of non-type-III. 

m\ is not a tip node: The children of m\ in M(pj) (1 child if mi is OR node, all children 
if mi is AND node) must belong to Qj-i- Since by hypothesis, every node in Qj-i has a 
non-type-III MES below it, and Qj is acyclic, it is clear that m x will have a non-type-III 
MES below it via its children in Qj-i. 

Thus the proposition holds for mi. Let us assume that it holds for m*, % > 1. We would 
show that it holds for m i+ i. 

m.j + i is a tip node: Clear. 

m.j + i is not a tip node: Since by hypothesis, every child of m.j+1 in Qj has a non-type-III 
MES below it, and Qj is acyclic, it is clear that m i+ i will have a non-type-III MES below it 
via its children in Qj. 

Thus the proposition holds for every node in Qj. 
Proposition 3. M(n) is a non-type-III MES. 

Observe that, M{n) has been obtained by using the last Qj, i.e. Qk and adding the node 
n to it via the arcs (n,pj), 1 < j < k. Clearly, as n ^ M(P,)Vj, this resulting structure is 
acyclic. Thus n will have a non-type-III MES through its children pi,p 2 , ■ ■ ■ ,Pk- 

This proves that n is a non-type-III node. 



n is a non-type-III node. Let M(n) be a non-type-III MES below it, and Pi,P2, ■ ■ ■ ,Pk be 
the children of n. Then by Lemma 3.4, the sub-MES below every pj is a non-type-III MES, 
and every pj must be of non-type-III. 

(b) Clear from (a). □ 



The importance of the Sub-problem Composition Theorems lies in the fact that they express 
the general notion of obtaining the solution graph at a node by using the solution graphs of its 
children; the fact that this can be done at all, even in the presence of cycles as proved in the 
theorems above, will be used in our correctness proofs of the algorithms SI and S2. Because of 
their fundamental importance, we illustrate the construction process used in the proof (routine 
Construct_Qk) in Figure 14 below. 

Consider the graph shown in Figure 14(a). Assume that it is a part of an explicit graph G' 
(not shown fully) below a node n in it. Now two MESs below p 1 and p 2 are shown in Figure 14(b) 
and 14(c) respectively. The result of combining these two MESs to form an MES below n using 
the Construct_Qk routine is shown in Figure 14(e). Note that: 

(i) Although M(pi) and M(p 2 ) taken together create a cycle, the construction of M(n) is such 
that when p 2 is attempted to be included in it, the routine finds that p 2 s child x is already there 
from pi in the previous construction. Hence, it retains the MES below x as the MES below p 2 
also. This strategy helps in avoiding the cycle between the MES of pi and the MES of p 2 . 

(ii) While constructing M(n) from the MESs of its children p i: not all nodes from an MES 
below a child pi need be retained in the MES below n. For instance, in Figure 14(c) y is a node 
in MES M(p 2 ), but y does not appear in the combined MES M(n). However, as we've seen in the 
proof of the theorem, this does not introduce any error in the construction. 



Figure : 


4: Illustration of the Sub-problem Composition Theorem 
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Figure 15: Explicit graphs from G2 (Figure 11(b)) 




3.2.4 Computation of Costs for Explicit-graph MESs 

The costs of an MES in an explicit graph are defined in the same way as they are defined in the 
implicit graph. 

Definition 3.12 Given any explicit graph G' , for any MES M(n, G') we denote the cost of a node 
u in M(n, G') by f3(u, M(n, G')). We define this cost function f3(u, M(n, G')) as follows: 

P(u,M(n,G')) 
= if u E T; 
= 00 if u E NT; 

= h(u) if u e Zm(u,g') f\ u & {T\JNT), h(u) being the heuristic estimate at node u; 
= c(u,w) + /3(w, M(n,G')), if u is an OR node and w is the child of u in M(n,G'); 
= Hwev{u,M{n,G')){ciu } w) + (3(w , M (n, G'))} if u is an AND node. 

f3(u, M(n,G f )) is undefined if u G Zm{ti,G') \ Zg 1 - 

Definition 3.13 Let Mi(n, G'), M 2 (n, G'), . . ., M k {n, G') be the psgs below a node n in an explicit 
graph G' . Then h'{n), the cost of a minimal-cost psg below n = mm 1 <j< fc { / 9(n, Mj(n, G'))}; if there 
are no psgs below n in G' , h'(n) is undefined. 

Thus in Figure 15(a) h'(r) = 2, h'(ti) = 0, h'(q) = 5, h'(p) = 8 and h'(s) = 15, while in Figure 
15(b), h'(r), h'{p) and h\s) are undefined as there are no psgs below them in G' 2 . 



4 Algorithm SI 



We now present a best-first bottom-up algorithm, SI, that operates on the full implicit graph G. 
SI assumes that G contains only finitely many nodes and arcs. The set of terminal and nonterminal 
leaves of G, as well as the problem composition and decomposition rules, are needed as inputs to 
SI. It also maintains the h- value of a node to represent the currently known minimum cost of 
solving the node. SI uses two lists OPEN and CLOSED. SI starts by putting all the leaf nodes 
of G in OPEN with an /i-value of or oo according as the leaf node is terminal or non-terminal. 
SI then proceeds by removing a node from OPEN that has the minimum h- value. Whenever a 
node is removed from OPEN, it is put into CLOSED, and all its parents are obtained by applying 
inverse operators. These parents are checked for possible inclusion in OPEN and their /i-values 
are updated, if necessary, in an additive manner. 

The point that needs special mention here is the treatment of AND nodes. An AND node may 
be removed from OPEN only if all of its children have already entered CLOSED. This is ensured 
by using a label "eligible" to identify a subset of nodes in OPEN. The nodes that are eligible are 
either OR nodes, or AND nodes with all their children in CLOSED. 

SI continues in this manner until the start node is removed from OPEN or it is evident that 
the start node is of type-Ill. If the start node is not of type-Ill then h(s) equals the cost of a 
minimal-cost solution graph and is outputted by SI. Note that, by appropriately maintaining 
pointers, SI can trace the solution graph whose cost is outputted by it. For simplicity, those 
details have been left out in this paper. 

4.1 Algorithm SI 

51.1 Create a list, OPEN, and set OPEN = Z G (leaf nodes of G). For each n in OPEN, label n 
as "eligible". Now, if n is a terminal node, set h(n) = 0; else set h(n) = oo. 

51. 2 Create a list, CLOSED, that is initially empty. 

51.3 While (OPEN contains an eligible node) do 

51. 3.1 Find an eligible node n from OPEN which has the minimum h- value. (Resolve ties 
arbitrarily, but always in favour of the start node s.) Put n in CLOSED. 

51. 3. 2 If n — s then if h(s) = oo, terminate with FAILURE; else output h(s) and terminate 
with SUCCESS. 

51. 3. 3 Let pi,p2, ■ ■ ■ iPk be the immediate predecessors of n in G. For each pi, 1 < i < k, 
do the following: 

Case I: p { is an OR node 

If Pi is not already in OPEN or CLOSED, set hfa) = h(n) + c(p h n). Put p, L in OPEN 
and label pi as "eligible" ; elseif pi is already in OPEN with h(pi) > h(n) + c(pi, n) then 
set h(pi) = h(n) + c(p i: n). 
Case II: p^ is an AND node 

If pi is not already present in OPEN, put it in OPEN and set h(pi) = c(pi,n) + h(n); 

else set h(pi) = h(pi) + c(pi,n) + h(n). 

If all children of Pi are in CLOSED, label Pi as "eligible". 



SI. 4 Terminate with FAILURE. □ 
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Table 1: Working of SI on the graph of Figure 11(a) 

4.2 Working of SI 

We now illustrate the working of algorithm SI. SI is an uninformed search algorithm. Its working 
is shown on the implicit graph G of Figure 11(a). In the figure, the arcs are labelled with their 
costs and the heuristic values of nodes are shown in parenthesis. However, these heuristic values 
are not for use by SI. They are to be used by algorithm S2 which is presented later. The iteration- 
by-iteration working of SI is presented in Table 1. Nodes in CLOSED are shown inside a square 
box. Among the nodes in OPEN, the non-underlined nodes are those which are marked eligible, 
and underlined nodes are those which are yet to become eligible. The h-vahie of each node is 
superscripted. The node n shown in the second column is the node selected by SI from OPEN in 
each iteration. 

In step Sl.l, OPEN is created with nodes ti, t 2 and y having h- values 0, and oo respectively. 
All these nodes are marked "eligible" in OPEN. Then in the first iteration t\ is selected from 
OPEN and put in CLOSED with h(ti) = 0, and its parent p is inserted into OPEN with h(p) = 5. 
The snapshots of OPEN and CLOSED at the end of iteration 1 are shown in the first row with 
t\ inside a square box. In subsequent iterations p, r, q and s are selected from OPEN, one in 
each iteration, and put into CLOSED. Finally, SI terminates by finding a minimal-cost solution 
of cost 14 of the solution graph s,p,q,r,ti. 

4.3 Analysis of SI 

The following definitions will be used in proving some of the properties of SI. 

Definition 4.1 By an iteration of SI, we mean one complete execution of step SI. 3, i.e. of 
substeps SI. 3.1, SI. 3. 2 and SI. 3. 3 (unless SI terminates at SI. 3. 2, in which case the iteration 
consists of steps SI. 3.1 and SI. 3. 2 only). 

Definition 4.2 By an instant, we mean the time point when step SI. 3 is about to be executed. 
Thus at instant j , SI. 3 is executed for the j th time. 

Definition 4.3 Let p be a node in G, and M{p) be a type-I or type-II MES below p. At any 
instant during the execution of SI, a node n e M{p) is a leading node of M{p) if (\) n is in 
OPEN and (\\) all its successors in M{p), if any, are in CLOSED. 

Lemma 4.1 If M{p) is a type-I or type-II MES below a node p G G, then at any instant during 
the execution of SI, the following must hold: 

(a) If q is a leading node in M(p), then q must be eligible in OPEN; 

(b) If p CLOSED then there must exist at least one leading node of M{p). 



Figure 16: Illustration of Leading node 
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Proof, (a) Clearly, g must be in OPEN, with all its successors from M(jp) in CLOSED (by 
the definition of a leading node). This makes it eligible irrespective of whether it is an AND node 
or an OR node. 

(b) We sort the nodes of M(p) in topological order, based on their height values H in M(p) 
(Such a topological sorting of nodes of M(p) is possible, since it is an MES which is acyclic by 
definition). For any node y G M{p), its height H(y) in M{p) is defined as follows: 

H(y)= if y is a leaf node; 

= maxi<i< fc {l + H(yi)}, where yi,y 2 , —,Vk are children of y in M(p). 

Let the sorted list, in descending order of if-values, be called L, and let a sublist of L be 
V = L \ CLOSED, and let x be the rightmost node in L'. 
If a; is a leaf node, it is clearly a leading node of M(p). 

If x is an internal node in M{p), all its successors are in CLOSED (otherwise x could not be 
the rightmost node in V). Also, x must be in OPEN, as (i) x £ CLOSED, and (ii) all successors 
of x in M(p) are in CLOSED. Hence x must be a leading node of M(p). □ 

Remark 4.1 

If q G M(p) is eligible in OPEN then q need not be a leading node of M(p). This can clearly 
be illustrated from the following example. For the implicit graph G shown in Figure 16(a), we 
consider the MES shown in Figure 16(b). After the first two instants of SI, t 2 and t\ have travelled 
to CLOSED. However, although q G M(p) is eligible, it is not a leading node in M(p). Here x 
and r are leading nodes in M{p). 

Lemma 4.2 When SI runs on a finite AND/OR graph G, at any instant i, no type-Ill node ever 
enters CLOSED. 

Proof. This may be easily seen from the following statements. 

(a) Only eligible nodes from OPEN can enter CLOSED. 

(b) No eligible node can ever be of type-Ill. 

Statement (a) is clear from Step SI. 3.1 of algorithm SI ; the proof of (b) follows. The proof is 
by induction on the instant % of SI. 

For 2 = 1: The only eligible nodes are leaf nodes, which are of type-I or type-II. 

Up to i — k: We assume that no node that has become eligible is of type-Ill. 

i = k + 1 : Let n be the node selected from OPEN and sent to CLOSED at instant k and let 
p be a parent of n. Now if p becomes eligible in OPEN at instant k + 1, we show that p can not 
be of type-Ill. 

Case I: p is an OR node. Since, by assumption, n is of type-I or type-II, p would also be of 
type-I or type-II (by the Sub-problem Composition Theorem for Implicit Graphs). 



Case II: p is an AND node. Since p is now becoming eligible (by assumption), all the children 
Pi,P2, ■ ■ ■ ,Pi of p must now be in CLOSED, i.e. they must have been eligible at some instant prior 
to instant k + 1. By hypothesis, none of these children of p can be of type-Ill. Therefore, by the 
Sub-problem Composition Theorem for Implicit Graphs, p cannot be of type-Ill, either. □ 

Lemma 4.3 When SI sends a node n to CLOSED, h{n) = h*(n). 

Proof: Let ni be the node travelling to CLOSED at instant i, i = 1,2,.... We prove the 
lemma by induction on i. 

This lemma is applicable when a node travels to CLOSED. At instant 1, n± is clearly a leaf 
node with its h- value set to h* (0 or oo according as Hi is terminal or nonterminal). We assume 
that algorithm SI has put nodes ni,n 2 , . . . ,n k in CLOSED with h{rii) = h*(ni), 1 < i < k. We 
need to show that n k+ i goes to CLOSED with h{n k+ i) = h*{n k+ i). Clearly, by Lemma 4.2, n k+ i 
must be either of type-I or type-II. 

Case I : n k+ \ is a leaf node. Trivially true. 

Case II : n k+ \ is an AND node. Clearly, since n k+ \ is now eligible, all its children must have 
been previously put into CLOSED with their h- values = h* values (by hypothesis). Now, since 
SI computes the h- value of an AND node by successively adding the h- values of its children when 
each of them is selected from OPEN, then h(n k+ i) = h*(n k+ i) for the AND node n k+ \. 

Case HI: n k+ \ is an OR node. Let p be the child through which n k+ \ had last received its h- 
value prior to getting selected from OPEN, i.e. h{n k+ \) = c(n k+ \, p) + h(p) . If h(n k+ i) ^ h*(n k+ i), 
let q be the child of n k+ \ in a minimal-cost MES M below n k +i, i.e. h*{n k+ \) = c{n k+ \, q) + h*(q). 
And, it must be the case that q has not travelled to CLOSED yet. We shall show that this leads 
to a contradiction. 

Let £(q, M(n k+ \)) be the sub-MES below q in M(n k+ i). By Lemma 3.1, £(g, M(n k +i)) must 
be of type-I or type-II as M is of type-I or type-II. Moreover, by Lemma 4.1, as q has not entered 
CLOSED, £(g, M{n k+ \)) must have leading node(s) at instant k + 1. 

Let qi, q 2 , ■ ■ ■ , q m ( m > 1) be the leading nodes of £(g, M{n k+ i)) at instant k + 1. 

Now, from the definition of a leading node, each qj must have all its children from ^(g, M{n k+ i)) 
in CLOSED. Clearly, by the induction hypothesis, each of these children must have its h = h*. 

Further, £(g, M(n k+ \)) is a minimal-cost MES below q, and all the children of qj from ^(g, M(n k+ i)) 
have already travelled to CLOSED and updated the h(qj) value, if needed. Thus h(qj) = h*(qj), 
1 < j < m. 

Therefore, when n k+ i is selected from OPEN at instant k + 1, every qj, 1 < j < m, has, 

h(Qj) = h*(Qj) 

< h*(n k+l ) 

< c{n k+ i,p) + h*(p), as p is n k+ is child 

= c{n k+ i,p) + h(p), by induction hypothesis 
= h(n k+1 ), as assumed previously 

Therefore, when node n k+ i was selected from OPEN, qj, being a leading node, was also eligible 
in OPEN with h(qj) < h(n k +i). This is clearly in contradiction to the best-first node selection 
criterion, on the basis of minimum h, used by SI. □ 

Theorem 4.1 SI, while running on a finite AND/OR graph G: 

1. terminates with SUCCESS, outputting h(s) = h*(s) ifG contains at least one solution graph; 



2. terminates with FAILURE, otherwise. 



Proof. 

1. (G contains a solution graph.) As G contains only finitely many nodes, no node returns to 
OPEN from CLOSED, and in each iteration one eligible node is removed from OPEN and 
put into CLOSED, SI can continue for finitely many iterations only. 

Let M be a minimal-cost type-I MES below s in G. At any instant before s goes to CLOSED, 
there will always be at least one leading node n from M, by Lemma 4.1. n, being a leading 
node of a minimal-cost type-I MES, must have h(n) = h*(n) < h*(s). Note that n is also 
eligible in OPEN. Now n cannot be left in OPEN indefinitely, since SI runs for only finitely 
many iterations and selects eligible nodes from OPEN on the basis of minimum /i- value. 
Thus eventually n is bound to be selected from OPEN and put into CLOSED. When n = s, 
the algorithm will terminate with SUCCESS, outputting h(s) = h*(s). 

2. (G does not contain a solution graph.) If s is of type-II, consider the argument in (1) above. 
Here also the argument follows surrounding the key concepts of a minimal-cost type-II 
MES M below s, and the leading nodes of M. Ultimately s will enter CLOSED with 
h(s) = h*(s) = oo, and SI terminates with FAILURE. 

If s is of type-Ill, since no type-Ill node enters CLOSED, the algorithm will continue as long 
as there are type-I and type-II nodes in G. Since there are only finitely many such nodes, ul- 
timately OPEN will become empty of eligible nodes, and SI will terminate with FAILURE. □ 

Theorem 4.2 Let G be an AND/OR graph with finitely many nodes and arcs. When SI runs on 
G, it makes 

1. exactly Ni iterations, where 

Ni = 1+ | {n | n is a type-I node with h*(n) < h*(s)} \, if G contains a solution graph; 

2. at most A r 1 iterations, where 

Ni =| {n | n is a type-I or a type-II node } \, if G contains no solution graph. 

Proof. At each iteration of SI before it terminates, one (new) eligible node is selected from 
OPEN and put into CLOSED. 

1. If G contains a solution graph, i.e. s is of type-I, let M be a minimal-cost type-I MES 
below s. Then, at every iteration before s is selected, there will be at least one leading 
node n of M, such that n is eligible in OPEN with h(n) = h*(n) < h*(s). Thus, at every 
iteration i, before s is sent to CLOSED, if the node p is selected from OPEN at iteration i, 
h(p) < h(n) = h*(n) < h*(s) < oo (since G has a solution graph). Besides these iterations, 
one more iteration will be there with p = n = s and h{p) = h*(p) = h*(s). 

2. Since no type-Ill node enters CLOSED, the only nodes entering CLOSED are of type-I or 
type-II, and the lemma follows easily. □ 

Definition 4.4 By a node evaluation we mean a single computation of h-values at step SI. 3. 3. 
Similarly by a node selection, we mean selecting an eligible node from OPEN at step SI. 3.1. 



Theorem 4.3 Given any finite AND/OR graph G, the following are true about the execution of 
SI: 



1. SI makes O(Ni) node selections 

2. SI makes 0{N\Ki) node evaluations, where N\ is as defined in Theorem 4-2, and K\ is the 
total number of nodes in G. 

Proof. 

1. Clear, since there are O(Ni) iterations and in each iteration there is exactly one node selec- 
tion. 

2. There are O(Ni) iterations of step SI. 3, in each of which one node is selected and O(Ki) 
parents of a node may get evaluated. □ 



5 ALGORITHM S2 

S2 is an improved version of the uninformed search algorithm SI. It resembles AO* and does 
heuristically-guided search in a top-down fashion. S2 works on an implicit AND/OR graph G, 
which is either finite, or infinite containing at least one solution graph. Thus S2 can work effectively 
on graphs having paths of infinite length, which is not possible by SI due to its entirely bottom- up 
nature beginning from the leaf nodes. 

S2 contains a procedure BottomJJp that works in a manner identical to SI on explicit graphs. 
Additionally, S2 maintains a variable, front, with every node. The purpose of the variable front 
with any node n is to identify one of its successors q which is preferably an unsolved tip node of a 
least-costly psg below n. Thus in any iteration prior to the termination of S2, the front of s is an 
unsolved tip node of a least-costly psg below s, and is the candidate node to be expanded next. 

5.1 Algorithm S2 

52.1 Create an explicit graph G' consisting solely of the start node s. Set front(s) = s. If s is a 
terminal leaf set h(s) = 0; else if s is a nonterminal leaf set h(s) = oo. 

52.2 While ( (front(s) is not a terminal leaf) and (h(s) ^ oo) ) do: 

52.2.1 Let n = front(s). Expand n, generating all its children n\,ni, ■ ■ ■ ,n^. Install each 
rii in G' as child of n, by setting the arc (n, rij). For each newly occurring node rii in 
G' set front(rii) = n^. If is a terminal leaf set h{rii) = 0; else if is a nonterminal 
leaf set h{rii) = oo; else set h(rii) = h(rii). 

52.2.2 Set OPEN = Z G >. Label all the nodes in OPEN as eligible, and initial. 

52.2.3 Call BottomJJp(OPEN). 

52.3 If front(s) is a terminal leaf node, output h(s) and terminate with SUCCESS; else terminate 
with FAILURE. 

PROCEDURE BottomJJp (List OPEN) 
Bl Initialize a list, CLOSED, to nil. 

B2 While (OPEN contains an eligible node and s £ CLOSED) do: 

B2.1 Select an eligible node q from OPEN that has minimum /i- value. (Resolve ties arbi- 
trarily, but always in favour of s). 



B2.2 If q is not an initial node, then do the following: 

Let qi, q2, ■ ■ ■ , q r be the children of q in G' which are in CLOSED. 
Case I: q is an OR node. 

Let r = mini<i< r {c(q, g«) + h^qi)} occur for i — j (resolve ties arbitrarily, but in favour 
of a node whose front is a terminal leaf). Set front(q) = front(qj). 
Case II: q is an AND node. 

Let qj be the leftmost child of q whose front is not a terminal leaf. If no such qj exists 
(i.e. every child of q has a terminal leaf as its front) set front(q) = front(qi); else set 
front(q) = front(qj). 

B2.3 Put q in CLOSED. Let Pi,P2, ■ ■ ■ ,Pk be the parents of q in G' . For each do: 
Case I: pi is an OR node. 

If Pi is not already present in OPEN or CLOSED, set h(pi) = h(q) + c(pi,q). Put 
Pi in OPEN and mark it eligible; elseif Pi is already present in OPEN with h{pi) > 
h(q) + c(pi, q), set hfa) = h(q) + c(p i: q). 
Case II: p,i is an AND node. 

If pi is not already present in OPEN, put it in OPEN and set h(pi) = h(q) + c(pi, q); 
else set h(pi) = h(pi) + c(pi, q) + h(q). If all children of pi are in CLOSED, mark pi as 
eligible. 

B3 Remove any remaining nodes from OPEN. 
B4 If s CLOSED set h(s) = oo. □ 

5.2 Working of S2 

In Tables 2 and 3, we present the working of S2 on the graphs of Figures 11(a) and 11(b). Each 
iteration of S2 is quite similar to the working of SI as presented in Table 1. Here, n is the node 
which is expanded in each iteration. With each node, an additional variable "front"' is added. Tip 
nodes have themselves as their fronts; other (internal) nodes initially have their fronts "carried 
over" from the previous iteration, and later the fronts are decided when these internal nodes 
enter CLOSED. In each iteration, the first row of column three shows the tip nodes of the explicit 
graph, that are initially put in OPEN and labeled as "eligible" and "initial". Nodes that are not 
eligible are underlined. CLOSED nodes are put in rectangular boxes. The h and front values 
of a node are superscripted above it. S2 makes use of heuristic values and runs in the top-down 
fashion. For the graph G\ in Figure 11(a), S2 outputs h*(s) = 14, i.e. the cost of a minimal-cost 
solution graph. For the graph G 2 in Figure 11(b), S2 terminates with FAILURE as there is no 
solution graph below s. 

5.3 Analysis of S2 

The results on the correctness and complexity of S2 are presented below. In this discussion, by a 
"Bottom- Up computation" we shall mean a call to the Bottom- Up procedure during an iteration 
ofS2. 

Definition 5.1 A heuristic function h>0 defined on the nodes of G is said to be admissible if 

h(n) < h*(n) for all type-I or type-II nodes in G. 

Definition 5.2 An execution of S2.2 (i.e. substeps S2.2.1, S2.2.2 and S2.2.3) is called an itera- 
tion of the algorithm S2. 

Definition 5.3 Given any iteration of S2, by an instant within it we refer to the time point when 
substep B2 of step S2.2.3 is about to be executed. 
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Table 2: Working of S2 on the graph of Figure 11(a) 
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Table 3: Working of S2 on the graph of Figure 11(b) 



Thus at instant j of an iteration, the substep B2 is executed for the jth time. 
Remark 5.1 

An iteration of SI contains exactly one instant, while an iteration of S2 will have many instants 
within it. 

Definition 5.4 Let p be a node in G' , and M(p) be a psg below p in G' . During any iteration of 
S2, a node n e M{p) is a leading node of M{p) at instant j if (i) n is in OPEN and (ii) all its 
successors in M{p), if any, are in CLOSED at that instant j . 

Lemma 5.1 During an iteration of S2, let M{p) be a psg below a node p G G' . Then at any 
instant of that iteration, the following must hold: 

(a) If q be a leading node in M(p), then q is eligible in OPEN; (b) Conversely, ifp (jL CLOSED , 
there will exist at least one leading node of M{p). 

Proof. Similar to the proof of Lemma 4.1. □ 

Lemma 5.2 In any iteration of S2, when a node n is sent to CLOSED during the bottom-up 
computation, the followings hold: 

1. h(n) = h'(n). 

2. front(n) = a tip node of a minimal-cost psg below n. 
Proof. 

1. By double induction. First on iteration i, and then on the nodes of the explicit graph which 
are sent to CLOSED in that iteration. 

Induction Basis. At iteration % — 1, s is expanded. Let rii, n 2 , . . . , n^, be those children of 
s which are tip nodes and are put in OPEN. 

Now for each rij, 1 < j ' < k, h(rij) equals 0, oo or h(rij) according as rij is a terminal leaf 
node, a nonterminal leaf node or an internal node of G. Since each rij is a tip node, whichever 
psg they may belong to, h(rij) = h!(rij), 1 < j < k (from the definition of h'). Whenever 
any of these nodes travels to CLOSED, it will have h = h! . 

Now we show that the result holds for s. 

If s is an AND node 

Case I: s did not enter CLOSED Clearly, by step B2 of the algorithm, s was not eligible. 
Then, among the children of s, at least one did not enter CLOSED. Since this is the first 
iteration and s is the only expanded node, this case is possible only if there is a self-loop 
at s, implying that s is of type-Ill. Thus s did not enter CLOSED, and the lemma holds 
trivially. 

Case II: s entered CLOSED . Since s is an AND node, all its children must have been sent to 
CLOSED prior to s becoming eligible. Now, S2 sends each child of s, nj, to CLOSED with 
h{rij) = h'(rij) (as rij is a tip node). Hence from the definition of h', clearly, h(s) = h'(s) 
when s is sent to CLOSED. 

If s is an OR node 

Case I: s did not enter CLOSED. Then the lemma is vacuously true. (Note that, since s is 
an OR node, if any of its children had entered OPEN, eventually s would also have entered 



OPEN, and then, CLOSED. Then it must be that no child of s even entered OPEN. This is 
possible only if s is a type-Ill node and there is just a self-loop from s.) 

Case II: s entered CLOSED. Then some child(ren) of s, prior to s itself, must have entered 
OPEN, and then CLOSED. Let M be a minimal-cost psg below s in the explicit graph of 
iteration 1. Let Z M = {n p }. Let us assume that s entered CLOSED with its /i-value defined 
by some child n q ^ n p and h(s) > h'(s). Then h(s) > h'(s) = c(s,n p ) + h'{n p ) > h'{n p ) = 
h(n p ) (since n p e Zm)- Then clearly, n p remained in OPEN when s got selected and sent 
to CLOSED. This is in contradiction to the criterion of node selection from OPEN on the 
basis of minimum fa-value. 

Thus the lemma holds for all nodes entering CLOSED in iteration 1. 

Induction Hypothesis. Let the lemma be true up to iteration % — I. 

Induction Step: i — I + 1. Let the nodes that enter CLOSED be ni,n2, . . .. Clearly, the 
first node, n 1; that enters CLOSED from OPEN must be a tip node of the explicit graph 
at instant I + 1, for which h{rii) = 0, oo, or h{rii) according as n x is a terminal leaf, a 
nonterminal leaf, or an internal node of G. Thus h(rii) = h'(ni) for the tip node n\. 

Let us assume that the lemma holds up to the /cth node at instant / + 1, i.e. nodes ni, . . . n k 
enter CLOSED with h{rtj) = h'(nj), 1 < j < k. 

We need to show that n k +i goes to CLOSED with h(n k +i) = h'(rik+i). 
Case I: rik+i is a tip node. Trivially true. 

Case II: rik+i is an internal node. There can be two subcases within this. 

Case 11(a): n^+i is an AND node. Clearly, rik+i must be eligible and all the children of nt+i 
must have been previously put into CLOSED, otherwise n^+i could not have become eligible. 
Hence n^+i's children must occur among rii, . . . , and have h- values = h', according to the 
induction hypothesis. Now, since S2 computes the /i-value of an AND node by successively 
adding the /i-values of its children when each of them is selected from OPEN, it is clear that 
h(nk+i) = h'{rik+i) for the AND node n^+i. 

Case 11(b): nt+i is an OR node. Let p be the child through which nk+i had last received 
its /i-value prior to its getting selected from OPEN, i.e. h{rik+i) = c{nk+i,p) + h{p). If 
h(nk+i) 7^ h'(rik+i), let q be the child of rik+i in a minimal-cost psg M below nk+i, i.e. 
h'(nk+i) = c(nk+i, q) + h'(q). We shall show that this leads to a contradiction. 

Since q has not yet entered CLOSED, the MES M(n£. +1 , G') that contains q must have leading 
nodes, by Lemma 5.1. Let the leading nodes of M(nk+i, G') be qi, q 2 , ■ ■ ■ , q m (m > 1). Since 
M(nk+i,G') is a minimal-cost MES, we have h(qj) = h'(qj), 1 < j < m. 

Thus every qj, 1 < j < m, has, when n fe+1 is selected from OPEN, 

h(qj) = ti(qj) 

< h'(n k+ i) 

< c(nk+i,p) + h'(p), as p is n^+i's child 

= c(n k+1 ,p) + h(p), by induction hypothesis 
= h(n k+1 ), as assumed previously 

Therefore, when node n k +\ was selected from OPEN, qj, being a leading node, was also 
eligible in OPEN with h(qj) < h(nk+i). This is clearly in contradiction to the best-first node 
selection criterion, on the basis of minimum h, used by S2. Hence the result. 

2. As in (1), the proof is by double induction, first on the iteration and then on the nodes of 
G' entering CLOSED in that iteration. We simply give an outline of the proof below. 



If a tip node n enters CLOSED, n has itself as its front, and the lemma is trivially true, as 
n is the only node in a minimal-cost psg below it. 

When an internal node n goes to CLOSED, h(n) is set to h'(n), (the cost of a minimal-cost 
psg below n), as proved in part (1) above. Now, when n enters CLOSED, its front is decided 
in step B2.2. If n is an OR node, front(n) is set to the front of one of its children which, in 
turn, defines h'(n). Note that h'(n) is the cost of a minimal-cost psg below n. Thus front(n) 
becomes a tip node of a minimal-cost psg below n. If n is an AND node, its /i'(n)-value is 
computed by adding the Zi'-values of all the children and the costs of the arcs connecting 
them with n. Now, front{n) is set to the front of one of the children of n, which in turn, 
is a tip node of a minimal-cost psg below it. □ 

Lemma 5.3 If s is not of type-Ill, then at the end of every Bottom-up computation, it must enter 
CLOSED. 

Proof. In any iteration, given the explicit graph, if s is not of type-Ill, there will exist psgs 
below s in G' . Let M ± , M 2 , . . . , M k be the all possible psgs rooted at s. Let Mj, 1 < j < k, be 
a minimal-cost psg below s with cost h'(s). Now it is easy to show that eventually s must enter 
CLOSED. At each instant, (i.e. execution of step B2 of Bottom-Up computation), a distinct node 
is selected from OPEN and put into CLOSED. Once a node enters CLOSED, it never returns to 
OPEN. Since in any iteration, there are only finitely many nodes in G', it is clear that the Bottom- 
Up computation cannot continue indefinitely. On the other hand, the Bottom-Up computation 
cannot get stuck since there must exist a leading node from Mj which is eligible (Lemma 5.1), prior 
to sending s to CLOSED. Thus it is clear that, after finitely many instants of a Bottom-Up compu- 
tation, s will become the leading node of Mj and will eventually be sent from OPEN to CLOSED. □ 

Lemma 5.4 Under admissible heuristics, at the end of Bottom- Up computation of every iteration 
of S2, we have h'{n) < h*{n), where n is a type-I or type-II node in G and is currently included 
mG'. 

Proof. Let M be a minimal-cost MES below n in G, and let M' be the portion of M contained 
in G'. Thus, M' is a psg below n in G'. Now clearly the following observations will hold: 

(a) From the definition of h', h'(n) < (3{n, M'), since M' is just one of the psgs below n and 
M' need not define h'(n), and (b) f3(n,M') < /3(n, M) = h*(n), by the definition of (3 (Definition 
3.12) and the admissibility of the heuristic function (Definition 5.1). 

Now combining (a) and (6), the lemma follows. □ 

Lemma 5.5 In any iteration of S2, during the BottomJJp computation, no type-Ill node in G' 
ever enters CLOSED. 

Proof. The proof is similar to that of Lemma 4.2 for SI. Note that the Sub-problem Com- 
position Theorem for Explicit Graphs (i.e. Theorem 3.2) is used in place of the Sub-problem 
Composition Theorem for Implicit Graphs (Theorem 3.1). □ 

Theorem 5.1 // the implicit graph G has at least one solution graph, then S2 running with 
admissible heuristics terminates with SUCCESS by outputting h(s) = h*(s). 

Proof. Let G be any AND/OR graph containing at least one solution graph. Since node 
branching factor is finite, there are only finitely many psgs M below s which are subgraphs of G, 
having costs f3(s,M) < h*(s). Now, combining Lemmas 5.2 and 5.4, h(s) = h'(s) < h*(s) (s being 



a type-I node, as it has a solution graph below it) at the end of each iteration of S2. Now by 
Lemma 5.2, at the end of each iteration, front(s) is set to a tip node of a minimal-cost psg below 
s, and this front(s) is expanded in the next iteration. Since every front(s) is unique and arc-costs 
are positive, after finitely many iterations there will be no more psgs having costs < h*(s). Thus 
unless S2 terminates, it has to continue the search with psgs having costs > h*(s), which is in 
contradiction to the Lemma 5.4. Thus S2 must terminate after finitely many iterations. 

Again, as s is of type-I, h*(s) < oo, implying that at the end of each iteration h(s) < oo. 
Hence S2 cannot terminate with FAILURE. 

Hence the alternative termination condition namely, front(s) is a terminal leaf, must hold. 
Let M be the psg below s that sets front(s) to be a terminal leaf. Then Zm cannot contain a 
non-leaf tip node (otherwise front(s) could not be a terminal leaf, from the step B2.2 of S2). Now 
Zm cannot contain any nonterminal leaf either, as it would otherwise violate the fact that s is a 
type-I node and front(s) is a tip node of a minimal-cost psg below s. 

Therefore, M must be a solution graph, and h(s) = /3(s, M) > h*(s). 

Combining this with h(s) = h'(s) < h*(s) at the end of every iteration (Lemma 5.2 and Lemma 
5.4), we have h(s) = h*(s) at termination of S2. □ 

Theorem 5.2 S2 terminates with FAILURE on a finite AND/OR graph G that does not contain 
a solution graph. 

Proof. Let G be a finite AND/OR graph that does not contain a solution graph. While 
S2 runs on G, in its every iteration a node called front(s) is expanded. But in every iteration, 
front(s) is a distinct node, i.e. a tip node of a minimal-cost psg below s. Since G is finite, S2 can 
run for finitely many iterations. 

The termination of S2 can happen either when front(s) is a terminal leaf or when h(s) = oo. 
Now, if front(s) is a terminal leaf, S2 must have obtained a solution graph below s. This contra- 
dicts the fact that G does not contain a solution graph. Hence, the other condition for termination 
namely, h(s) = oo must hold, implying that S2 terminates with FAILURE. (This FAILURE ter- 
mination can happen in either of two ways, s could be a type-II node in G, in which case h(s) is 
set to oo in step B2.3. Alternatively, s could be a type-Ill node in G, in which case it does not 
travel to CLOSED at the end of a Bottom-Up computation and has its h-value set to oo at step 
B4. In either case S2 terminates with FAILURE.) □ 



5.4 Complexity Analysis of S2 

In this section, we present the complexity analysis of S2. 

Definition 5.5 (i) Let G be an AND/OR graph. We define a set of nodes V as follows: 

1. If G has at least one solution graph, then: 

(a) s is in V if s is not a terminal or nonterminal leaf node; 

(b) a node n is in V if n is not a terminal leaf node, and if there exists a psg M below s in 
some explicit graph G' for G such that n is a tip node in M and f3(s, M) < h*(s). 

2. If G has no solution graph, but it is finite, then every non-leaf node n G G will belong to V . 
(ii) Let N 2 =\V \ . 

Theorem 5.3 Let G be an implicitly defined AND/OR graph, such that: (a) G contains at least 
one solution graph or (b) G is finite. Now when S2 runs on G with an admissible heuristic, the 
followings are true: 



1. S2 requires 0(K 2 ) = 0(N 2 ) storage; 



2. S2 makes at most N 2 + 1 iterations; 

3. S2 runs in 0(N 2 K$) = 0(N%) time; 

where N 2 is as in Definition 5. 5, and K 2 is the total number of nodes in the explicit graph G' in 
the last iteration of S2 (K 2 < b * N 2 , where b is the maximum branching factor of a node, which 
is finite). 

Proof. 

1. S2 stores the entire explicit graph having K 2 nodes, with 0(b) information at each node. As 
b is finite, the result follows. 

2. (a) If s is a terminal leaf node, S2 will terminate in the first iteration. Otherwise, S2 continues 
the search by expanding front(s) in every iteration. Thus, as long as S2 does not terminate, 
it must be the case that front(s) is neither a terminal node, nor is it a nonterminal node. 
Moreover, front(s) will be assigned to a distinct node in every iteration of S2. 

Thus at each iteration prior to termination, S2 expands a distinct node, say n. Now, from 
Lemma 5.2(2), n is a tip node of a minimal-cost psg M below s. Such a psg will have cost 
(3(s,M) = h'(s) < h*(s) (Lemma 5.4). Thus the expanded node n (= front(s)) will be one 
of the nodes of V. Thus prior to termination, S2 makes at most N 2 node expansions in as 
many iterations, plus one more iteration for termination. 

(b) If s is a nonterminal leaf node, S2 will terminate in the first iteration. Otherwise, it 
continues the search by expanding a distinct node as assigned to front(s) in every iteration. 
Since there are N 2 internal nodes in G, S2 can make at most N 2 expansions in N 2 iterations 
before terminating in the (N 2 + l)th iteration. 

3. S2 makes N 2 + 1 iterations, from (2) above. In each iteration (prior to the terminating one), 
it does the followings: 

(a) (Step S2.2) Checks for termination, in 0(1) time; 

(b) (Step S2.2.1) Expands a node and generates its children, in 0(b) time (where b is the 
maximum branching factor of a node, which is finite); 

(c) (Step S2.2.2) Creates OPEN, in 0(K 2 ) time, since there are K 2 nodes in G'; 

(d) (Step S2.2.3) Calls Bottom-Up; in each call, it: 

i. (Step Bl) Creates CLOSED, in 0(1) time; 

ii. (Step B2) Checks conditions, in 0(1) time, and makes 0(K 2 ) iterations of the 
following steps: 

A. (Step B2.1) Selects an eligible node from OPEN, in 0(K 2 ) time; 

B. (Step B2.2) Decides front, in 0(b) time; 

C. (Step B2.3) Evaluates the parents of the selected node, in 0(K 2 ) time; 

iii. (Step B3) Clears OPEN, in 0(1) time; 

iv. (Step B4) Checks whether s E CLOSED, in 0(1) time. 

Thus the overall time complexity of S2 is 

0(N 2 (1 + b + K 2 + (1 + 1 + K 2 (K 2 + b + K 2 ) + l + 1))) = 0(N 2 Kj) = 0(b 2 Nl) = O(Nl). 



□ 



6 Experimental Results 



The algorithms SI, REV*, S2, CFCrev* and AO* have been empirically compared on a DEC- 
Alpha workstation. All algorithms except CFCrev were programmed in C++. For CFCrev*-, 
the C-code was obtained from the website mentioned in [Jimenez and Torras 2000]. The experi- 
mental results obtained are now discussed. 

First the AND/OR graphs (problem instances) are generated using the following parameters: 
the total number of nodes in the implicit graph, the percentage of AND nodes, and whether the 
graph is cyclic or acyclic. For every combination of these parameters, the relevant algorithms are 
run over a set of one hundred randomly generated graphs. The average time of execution and the 
average number of nodes evaluated in each case (i.e. over 100 problems) are noted in tables 4 and 
5. The time is mentioned in CPU Clock Ticks, where 1 Clock Tick = 10~ 6 sec. The number of 
node evaluations is the number of times the cost of a node is computed during bottom-up phase 
(one bottom-up phase for SI and REV*, multiple bottom-up phases for S2, CFCrev and AO*.) 
The tables 4 and 5 show a snapshot of the experimental results. As it was found that variations 
in heuristic estimate or node branching factor do not reveal any new information, the heuristic 
estimate was kept at 90% to 100% of the solution cost for each node and the node branching factor 
was kept fixed at 3. 

One characteristic of the graphs we used in our experiments is that, the start node is always 
type-I - i.e. it is not of type-II or type-Ill. The reason for choosing this is that if the start node 
is of type-II or type-Ill, it is assigned a very high heuristic value by our heuristic computation 
program. In that case, heuristic search algorithms like S2 or CFCrev find the /i-value of start 
node to be oo (in the first iteration itself) and exit from the problem almost immediately, making 
a fair comparison with SI or REV* impossible. For this reason, the graphs that do not have a 
solution graph below s are discarded from our set of hundred problems in each case. 

From the tables, the following observations can be made: 

1. the smallest execution time is taken by REV* among uninformed algorithms and by CFCrev* 
among heuristic search algorithms; 

2. SI makes less node evaluations than REV*. This is expected, as SI is designed to work in a 
best-first manner. But the same effect is not observed between S2 and CFCrev* - actually, 
S2 makes more node evaluations than CFCrev*- While this may appear to be contradictory, 
the reason is that CFCrev* operates on a much smaller size of OPEN (compared to S2) 
in each iteration. Ultimately, this effect dominates over the best-first nature of S2. It is 
interesting that AO* makes even less node evaluations (than CFCrev*) on acyclic graphs. 
This is because, AO* employs the best-first principle along with a smaller set of initial nodes 
(in Z-list) and thereby gains in node evaluations over both S2 and CFCrev- 

3. In spite of evaluating less nodes than CFCrev*, AO* takes more time to execute. This is 
because, the time taken for predecessor-checking during the bottom-up computation in AO* 
is quite high and dominates over the time for node evaluations; 

4. S2 makes more node evaluations compared to CFCrev*, but takes much less time per node 
than CFCrev*- Thus S2 does less work per node compared to CFCrev*, which is also 
clear from the design of the algorithms. 

5. It was observed that under identical tie-resolution strategy, S2, AO* and CFCrev* expand 
the same number of nodes while running on acyclic graphs. The same is true about S2 and 
CFCrev* 011 cyclic graphs. The data has been omitted from this presentation. 
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Table 4: Performance of Algorithms on Acyclic Graphs 



7 Summary and Future Work 

Best-first search in cyclic AND/OR graphs had been a long- unresolved problem of artificial intel- 
ligence. Over the last few years, a number of studies have been reported on this topic. However, 
all of these studies lacked an unified theoretical framework for both cyclic and acyclic AND/OR 
graphs, and this led to a lacuna in the theoretical proofs of those algorithms. In this paper, we 
have taken a fresh look at the problem. First, a new and comprehensive framework for cyclic 
AND/OR graphs has been presented, which should be of use to future researchers as well. Then 
two best-first algorithms, SI and S2, have been developed for searching AND/OR graphs in the 
presence of cycles. The new theoretical framework has been useful in establishing the correctness 
and complexity results of SI and S2 in detail. 

SI and S2 have been implemented on a DEC-Alpha Workstation, and a large number of 
experiments on randomly-generated graphs have yielded correct results in all cases. However, 
computational times show that the execution time of S2 is not favourable compared to that 
of CFCrev* (on the same set of random graphs and heuristic distribution). Again, the node 
evaluations of CFCrev* is not favourable compared to that of AO* (on acyclic graphs). Clearly, 
the design of an algorithm that has the best-first nature of S2, the time performance of CFCrev* 
and the node performance of AO* remains a research challenge for the future. 
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Figure 17: opt (A, s) collapses 



1 




t 




1 



t 



(a) Implicit Graph A 



(b) Implicit Graph A 



9 Appendix 

The problem of cyclic AND/OR graph search has attracted considerable research attention in 
recent times [Chakrabarti 1994; Hvalica 1996; Jimenez and Torras 2000]. However all these 
attempts have overlooked certain important issues, which underline the fundamental nature of the 
problem. We briefly review these recent work in this section. 



[Chakrabarti 1994] has suggested a definitional framework and two algorithms, Iterative_Revise 



Nilsson's statement "This recursive definition is satisfactory because we are assuming acyclic 
graphs" [Nilsson 1980, pp. 102] was a warning in disguise that for cyclic AND/OR graphs, it may 
be difficult to have a formalism based on recursive structures. [Chakrabarti 1994] defined the basic 
structure opt(A,n) (pp. 331) which enters into an infinite recursion. [Chakrabarti 1994] used the 
following notation: 

A: Implicit AND/OR graph 

opt(A, n): Cost of a minimal-cost solution graph below node n in A 
t{n): Non-negative cost at terminal node n 
D{n): Solution graph below n 

We now reproduce the definition of opt(A,n) from [Chakrabarti 1994]: 

For every AND/OR graph A, the quantity opt(A, n) is defined recursively as follows: 

opt(A,n) = t{n), if n is a terminal node in A; 

= oo, if there does not exist any solution graph D(n) in A; 

= mini<i<k{opt(A, ni) + c(n, rij)}, for OR node n with immediate 

successors rij, 1 < i < k] 
= zLi<i<£i{0£>t(A Hi) + c(n,ni)}, for AND node n with immediate 

successors rij, 1 < i < k. 

We now illustrate the severe drawback in this definition on the implicit graphs A of Figure 17. 

On the OR graph of Figure 17(a), opt(A,s) = min{opt(A, s) + c(s, s),opt(A,t) + c(s,t)}, 
which leads to an infinite recursion. Similarly on the AND/OR graph of Figure 17(b), opt(A, s) = 
[{opt(A,p)+c(s,p)} + {opt(A,t)+c{s,t)}] = \{min{opt(A, s)+c{p, s),opt(A,t)+c{p,t)}+c{s,p)} + 
{opt(A,t) + c(s,t)}], which again leads to an infinite recursion. 



9.1 Algorithm REV* 



and REV*. 



Scenario 1. Basic structure collapses 



Scenario 2. Best-first principle violated 



Figure 18: REV* selects nodes with oo cost, while h*(s) = 2 




OPEN: /°, /°, y°°J 2 OPEN: /°, /°, y-,^ 2 

Found: t\, tij, g 5 , -2 10 , s 2 Found: *?, ^, y°°, p 1 , q°°, z°°, s 2 

(a) Implicit OR Graph A (b) Implicit AND/OR Graph A 



We now turn to the algorithms presented in [Chakrabarti 1994]. Of the two algorithms, REV* 
is an improved version of the depth first algorithm Iterative_Revise. REV* performs bottom-up 
search by treating the implicit graph in an explicit fashion. Starting the search from the leaf 
nodes, REV* applies inverse operators, using a list OPEN on the way. 

However, when REV* selects a node n from OPEN, it immediately tries to select and evaluate 
its predecessor nodes, without letting these predecessors to enter OPEN. This causes serious 
violations of the best-first criterion and as a result it ends up selecting nodes with higher (or 
00) costs than the minimum cost of a node in OPEN. This paradoxical phenomenon is clearly 
portrayed in the two examples given in Figure 18. 

Figure 18(a) presents a simple OR graph, for which a best-first search algorithm like Dijkstra's 
will never visit nodes q with cost 5, and z with cost 10. But REV* works as follows: initially it puts 
in OPEN the "found" nodes t±, t 2 and y with costs 0,0 and 00 respectively. Then, after selecting 
ti, it continues its upward computation through the "found" node p up to s, and inserts s with 
cost 2 (i.e. the cost of a minimal-cost solution graph) in OPEN. Next the node t 2 is selected. Now, 
although s awaits in OPEN with cost 2 for selection, REV* proceeds upwards from t 2 , selecting 
"found" nodes q and z, with costs 5 and 10, ignoring the legitimate superior candidacy of s in 
OPEN. 

Figure 18(b) depicts a similar situation in presence of an AND node q. Here, nodes q and z, 
with 00 costs, will get preference over s with cost 2 in OPEN. 

Scenario 3. Basic theorem fails 

The most severe fallout of REV*'s violation of the best-first principle is that, the Theorem 
5.3(iii) of [Chakrabarti 1994] fails, where it was claimed that "algorithm REV* examines all 
those nodes in OPEN for which opt(n) < opt(s)." This can be readily verified from the graph of 
Figure 19. On this graph, REV* starts by putting nodes t±, x, t 2 and t 3 , with UB-values 0, 00, 
and respectively, into OPEN. Then after removing t\ and inserting n with UB(n) = 10 into 
OPEN, REV* removes t 2 and inserts p with UB(p) = 1 in OPEN. Next, it selects t 3 from OPEN, 
but cannot declare s "found" as p is not yet "found" . Finally, it selects p from OPEN, declares p as 
"found" and then, declares s as "found", too, with UB(s) = 102. Then REV* terminates, without 
selecting n from OPEN which has opt(n) = 10 < opt(s) = 102. This is a clear contradiction to 
the Theorem 5.3(iii) of [Chakrabarti 1994], according to which REV* should also have selected n 
from OPEN, before termination. 

9.2 Hvalica's Method 



In a recent paper [Hvalica 1996] has attempted to solve cyclic AND/OR graphs by attaching an 
arc to a new dummy node Xf, with a high heuristic value, from the node currently being expanded. 



Fi gure 19: REV* theorem fa ils 




Implicit Graph A 



Fig ure 20: Hvalica's method lo ops 




Implicit Graph G 



This method is based on the premise that, even if expansion of the current node creates a cycle, 
the algorithm can come out of the cycle by looping through it a sufficient number of times (when 
the cost of the expanded node, computed through its children, exceeds the high cost attached to 
the dummy child xj.) [pp 108, Hvalica 1996]. In the example of Figure 20, this method will make 
a large number (H) of unnecessary evaluations of node p, where H represents a very high value. 
This method clearly violates the best-first search principle. Even if a node n has a solution graph 
below it (and a self-loop of unit cost), and a choice of H — h*(n) is used when expanding n, there 
will exist cases where the looping at n will violate the best-first principle globally, with respect 
to some other less-costly node of the graph. Similarly we can construct cases where the looping, 
even though not violating the best-first principle, results in unnecessary computations. 

9.3 Algorithm CFC REV * 

As mentioned by the authors [Jimenez and Torras 2000], the algorithm CFC RE v* has been de- 
signed primarily keeping the efficiency in mind. While that objective seems to have been achieved 
(as observed in the Experimental Results), the algorithm lacks any clear theoretical framework. 
The paper has used the "standard notation and definitions stated in" [Mahanti and Bagchi 1985] 
which, however, was written for acyclic AND/OR graphs. The problems that arise in this situation 
(i.e. the infinite recursion in cyclical definitions) has been discussed in detail under REV*, so we 
refrain from repeating that here. We only observe that, given the absence of a correct theoreti- 
cal framework, the correctness proofs of algorithm CFCrev* stand on a weak base. Again the 
best-first nature of the algorithm, which was shown to be violated in the case of REV*, is easily 
violated in the case of CFCrev* as we U- This is precisely the reason why CFCrev* evaluates 
many more nodes than AO* on acyclic graphs, which is based on best-first search. 



